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AfiSIBACT > 

Thji recent deluge of published studies eaploying 
sentences or connected discourse as the unit of study has left 
unsolved the question of whether the tvo types of materials are 
essentially^^ia^ar cr importantly di/ferent. An understanding of 
this issue is ycrucial to theory, since the existence of aajor 
psychological differences between the cbaprehension, learning, and. 
■ewoxy of sentence lists and connected discourse would lake a unified 
theory covering both types of materials exceedingly difficult to 
formulate, ihile offering no final resolution of the issue, the 
present paper examines the evidence, considers the implications 
several major theoretical developnents, and ppses question^ amiable 
to experiiectation. It is hoped that the paper will serve as a/ 
springboard tc a highjer level ^f understanding of how people ps^cess 
these two ccmion types of experimental materials^ A list of 
references is included. (Author/JB). 
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Abstract 

■ / - . ■ f 

The recent deluge of published studies ^ploying sef^ences or 

connected discourse as the unit of study has leftjthe question of whe- 

. - • - • 
ther the two types of mateiMals are" essentially similar or importantly 

different unsolved. An understanding of this iSsue ts crucial||lo theory, 

since the existence of major psychological differ/^ces between the com- 

prehension, learning, and merfwry of^erftenc6 lists and connected discourse! 

would make a unified theory covering both types of materials exceedingly ^ 

difficult to forhtulate, ' S^, ^ , if ^ ' \ 

While offering no final resolution of the issue, the present paper, 

■ / • ' 

examines the evidence, considers the implications of several^^^jor theo- 
retical developments, and poses que5ti9ns amenable to experimen taction. I 
is hoped that the paper^wil^ serve as a springboaj^ to a higher level of | 
understanding of. ho. people process, these two co-on types of expe^^ 
materiiils. \ ^ ' ! 
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Sentences in Lists and in Cojinected Discourse 

/ . ' • • ^ 

The^ast few years have seen a marked trend ir\ the literature of-ex^ 

perlroental psychology toward the study of the comprehension, learning, and 

• j - ■ \ ■ Av-;,A 

memory of connected discourse. The recent surge bfolq^^^ seems to have 

begun with papers by Fillenbaum (1966) and Sachs (19o7), but the area is 

not exactly new to psychology. Ebbinghaus (1885) himself studied discourse 

learning and reported that he needed only one- tenth theVnumber of repetitions 

\ 

to learn a passage of prose (from "Don Juan") as He did to learn a list of 



the prose. 



memory 
of the major 



nonsense syllables equal in length to the number of syllables in 
Bin^t and Henri^ (tB940 reported a series of studies on discourse 
with. children which showed great insight and anticipated several 
themes of recent experimental worle. 

Although the tradition of discourse experiments is far oldeK »*ecent 
developments have caused the swift /formation of a literature and , tradition 
of experiments on sentences in unrelated lists. The event whicf marks the 
beginning of the heyday of sentence list experiments was the cnation of the 
discipline of psycholinguistiCs. Early experiments with sentence lists 
(e.9., Mehler,M96^Miller, 1962) and most since then have beeh concerned 
with attempting to tm experimentally the psychological validity of lin- 
guistic concepts s%h as transformational complexity (Chomsky, !l957, 1965) 
or Yngve depth (1960). 

Hhile the study of sentences qua sentences is certainly justifiable 
on its own merits, ttie relationship between this newer experimental 
material and connected discourse reipa ins unclear. Deese (1961) considered 
the even broader question of the relationship of experimental materials 
"from the isolated verbal unit ta connected discourse," but, unfortunately, . 
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his paper was written before the recent delilge of work and was tnfluenced 
primarily by work done with approximations to English (Miller & Selfridge, 
1950). It is, perhaps, time to reexamine the question in light of more 
recent theory and exper*1ment. ^ ^ 

The implicit assumption that seems to run through most of the work 
in the area is that both sentence lists and connected discourse are essen- 
tially similar psychologically, although discourse, being^raore complex, 
may involve additional processes. This sentiinent is reflected in the post- 
Chomskian orthodox position that linguistic accounts of the sentence tan >^ 
be extended to discourse because sentences* in discourse can be joined 
together by appropriate conjunctions or otrter connectives to form one 
extended hypersentence (e.g. Katz & Fodor, 1963). ' ^ 

While this view is not implausible, there is reason to question it. 
Bartl^tt (1932) warned that the apparent simplicity of experimental 
materi^ils, when presented to so complex an organism as a human subject, 
does?*not ensure simplicity of psychological processes. This has recently 
proved to be the case with the traditional materials of verbal learning. 
The work of Montague, Adams, Prytullac and others (see Montague, 1972 for 
review) shows that such seemingly simple materials as nonsense syllables, 
paired-associates, and word lists are often processed and learned in highly 
complex and idiosyncratic faShion^ Therefore, it is surely risky to 
simply assume that learning sentences in lists is similar to and simpler 
than learning connected dis€t^urse. 

The purpose of this paper is to examine the qi^stion of whether the 
procesises Involved In learning senfences In lists and connected discourse 
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\ ■ ' 

are truly similar or whether important differences exist. A better 
understanding of this. relationship is crucial t»iR order to permit the 
.interpretation of the burgeoning body of litferature in the sentence-dis- 
course area. A better understanding would also help, to guide decisions of 
future experimenters as to whether an experiment should employ a sentence 
list or discourse: if the two are essentially similar with regard to the. 
psychological processes in question, then such a decision could be made on 
the basis of expedience or other considerations, but if there are impor^nt 
differences, then the decision will have to be made on the basis of an \ 



understanding of those. differences. Further, such an understanding should 
guide the scope of future' theories, whether they should cover both types 
of experimental materials or be specific to one or the other. The under- 
Standing gained might even influence th6 content of future theories by 
spotlighting important psychological aspects of the materials. 

Since there is a paucity of experimental data on the issue, it will ' 
not be possible to achieve a resolution of the question within the confines 
of- this paper. Rather, I will attempt to provide backgcoun^j#^u considering 
the question, by reviewing a few important similarities between sentence . 
lists and discourse, as ^ backdrop against which to ^tetJ* the discussion of 
established and possible differences, which will constitute the bulk of the 
paper. This will be followed by a mdre detail«d examination of the rela- 
tively few studies which have directly compared the two materials. After 
that, a more explicit statement of tffe questions /involved will be madei^ 
and the is^ue will be examined in the ligh^^-^f^heoreticay and empirical 
advances in the fields of discourse aml^mory structursr and subject 
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processing. Finally, specific hypotheses and suggestions for their experi- 
mental' investigation will be offered. In this way, it is hoped that the 
paper will help move psychology some small distance further down the road 
to understanding what happens when people read or hear sentences and 
connected discourse. 

/ Similarities Between Sentence ListS and Discourse \ 

Memory for Ideas Versus ^femory for Words 

Experimental psychology, whetfier due to' the influence of years of 
experimentation with nonsense syllables^ ^'or which paraphrase is" impossible, 
or due to the procedural difficulties of gist scoriit^ has tended to study 
verbatim reproduction of sentences and discourse instead of memory for 
the meaning of the sentences. People can, to be sure, remember sentences 
and discourse verbatim. Nearly everyone can recite word for word a few 
proverbs, several nursery rhymes, and any number of slogans for products 
advertised on teil|||is1on'. Actors caa commit to memory the exact wording 
of whole scripts. People can remember not only the wordSu but incider^aV^^^ 
physical attributes of the form in which they were presented. Thus, for 
Qxample, people can ^cognize the presentation mode (audio or visi^al) of 
words Ve.g. Br?y and Batchelder, 1972; Hintzman, Block, and Inskeep, 1972). 
Flemory ^or the gender of the reader's vo.ice has bee^i demonstrated with 
words (e.g. Hintzman, et a^., 1972) and senteaces (light, Strasburg, 
Rubin, and Linde, 2973). Hintzman, a^. showed the subjects' ability 
to discriminate between words presented in different type faces (upper 
case block letters versus lowercase script) and Kolers and Ostry (1974) 
fourjd in effect of typography (normal versus iaverted) of the initial 
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presentation of sentences upon recognition, even after a 32 day retention 
interval. Ij^ the only study of th;is type us^ng discourse, Rothkopf (1971) 
proved that subjects were able to specify^^jtith greater than chance 
^ccuracy, the location of the page on/<Jhich information fi^dm a passage 
occurred with respect to th^^entire 300 word passage, and also the spatial 
location of the information on the page. 

Although people cleaif'ly can remember the^ exact words and even the 
physical form of the words, it is also unquestionably tcue and of paramount 
psychological ^portance that people usually remember the idea of what they 
• hear or read far longer and-much more easily than the words that they en- 
countered in the proems. This fact was reported by Binet and Henri (1894) 
in .their historic paper which has ^nly-recently been rescued from obscurity 
and made available to noh- French readers in^n annotated partial trans- 
lation by Thieman and(Brewer (Note 1). While they also reporte^l a study * 
of memory for lists of unrelated words, their major interest was in memory 
, for bri^ prose passages. The study tested the memory of 510 French grjtde 
school tfhildrerr, and, perhaps because of the ^utfr o^^^their subjects, the 
passages ranged in leng^th^^*(om^ly 11 to^ 88 wor^ The passages were re^d 
to the subjects and recalled Immediately. 

They found, unsurprisingly, that the niinber of words recalled was a 
positive function of age and a negatively, accelerated function of passage 
length*. They also noted a progres^^ive shijftftin error type from tJ^ pre- 
dominant^ of synonym substitutions for shorter passages to tbe predominance 
•of omission^errops for .longer passages. It is their analysis of the synonym 
substitution^, however^ which is of greatest interest. They hypothesized 
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that synonym errors marked the loss of "verbal memory," but the retention 
of memory for ideas. Omission errors, on the other hand, signaled the, los^ 
of both types of memory. SfncB the forjner type of error predominated for 
shorter passages, their result shows that memory for words is lost before 
memory for ideas. 

They also examined the^synonyrp errors which occurred and noted that 
in the absence of verbal memory, the children tended to recall the passage 
in language, characteristic of French school children. Thus, in addition 
to synonym errors, there ^wa!$ a tendency to simplify th(j^ syntax of the \ 
passages. Both synonym and^ syntactic errors were seen' as instances of 
"Tassimilation verbolbgique," which is the tendency^of a person to recall ^ 
in language typical of that person. Early studies by Sharp (1899) and 
Henderson < 1903) replicated and extended Binetand Hehri's results. 

j One of the earliest demopstrati^affis of the fact that people generally 
remeLber the meanin|^ CQnveye^in a verbal message and not Jthe exact ' ' 
wording of the ijj^ssage was provided by Buehler (1908| cited In Humphrey, 
19*51) usinV lists of unrelated sentences'.. Bueh-ler saw thisVact as an * 

4 

illustration of the fnore general principle , that thoug^^ts ape^dependent 
of and ^ not follow the same processes as their sensory accompaniments. 
The sentence list (he used was a list of 20 unrelated proverbs. When a 
second list of proverbs was read, seme of which were similar in nieanin^at 
the metaphoricaMevel to "the proverbs in the first list ("jokers" of dis- 
tractors were aTso included), the subjects were able to identify them with 
great',certaicity. For example, subjects who heard "When 1?he calf is stolen, 
the fanner. repairs the stall" iti the original list would indicate it as the 
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appropriatelyr4elated sentence whenthey heard "One looks to the cas 

the v^ine escapes into the c^lar" in the second list. Despite their good 7 

perfoirmance on recognizing sYilar* ifieanings, however, subjects had difficulty 

when askecKto recall the items, often producing changes in the wording". .A: 

■ • - / . ■ 
Brewer (1974) points out, the study confounded a recognition-recall diffe 

ence withr its idea-wording comparison. 

More recent demonstrations, however7>»«ve avoided this difficulty-. ' • 

Vi^ ^. ^ ^ ' 

English, Welborn, and Killian (1934) compared substance and verbatim memory 
by using different types , of _true- false recognition items. Substance memory 
was tested by summary statements or paraphrased topic sentences, while ver- 
batim memory was tested bySentenqes lifted verbatim from the passage.. 
They found that for .immediate testing or a one day retention interval, 
verbatim mentory was equal to or greater than substance memory. At longer 
retention intervals (4-U weeks), however, verbatim memory declined while 
substance memory did not; so that, for the longer intervals, substance 
memory exceeded verbatim memory. Of course, this study has several weak- 
nesses, primarily that they confounded their verbatim-substance manipulation 
with sentence type:- summary or topic sentences versus non-summary sen- 
tences. A test only control group might also have been used to ensure that 
answering the questions required reading the passage. 

Recently, Sachs (1957) demonstrated the superiority of^tjl^'^ncr 
mentory using a recognition test. In her experiment, subjects listened to 
% 2Maped passages,, during eacji. of which the tape was p^iodically stopped 
for a sentence recognition test so that the interval between the targe^t 
sentence ih the passage and the reflated test item was 0,80, or 160 syllables 
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RecognitiorKi terns were either semantically changed, syntactically changed 



(so as^to leave the meaning essentially unchanged), or unchanged, ^It was 
found that with irmediate testing, recognition was accurate for^all types 
of items. After 80 or 160 syllables, however, while subjects still ex- 
ceeded chance in their ability to detect syntactic changes, their perfor- 
mance on these items was ferr below the level of detectiqn for semantic 
changes. 

Much of the recent evidence has come from studies erifploying sentence 

lists. Fillenbaum (1966) ran a study to test memory for the idea of a 

sentence using antonym pairs. He used sentences in each of which was j)ne 

member of an antonym pair; which was either negated or , non-negated. .Memory 

was testech using a four^ alternative forced choice recognition test^in which 

the four sentences of pr*esented (2 ^antonyms x negated Hfr 

n6n-negated). It was ra£wd that when errors occurred,' meaning was jSfeserved 

more often than changed, even though', for each item, 2 of the 3 distractors 

changed the meaning. This ggSult could only obtain if mlrnory for the 

meaning of the sentence remained even when memory for the words was lost. 

Br^er and his-colleagues have conducted a series of studies showing 

pefiority of meftjory for ideas over memory for words for both recog- 

nition andVecall. BoCk and Brewer (1974) used- sentence lis^ts cor^taining 

sei^ences whTch were drawn from pairs of sentences which were essentially 

synonymous, but differed in the application of an optional syntactical 

transformation. -They had^these sentences rated for stylistic preference' 

by an independent group of subjects. When' recall was tested, errors t^ded 

to preserve meaning and to shift toward the preferred form, for all six 

^ • / 
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types of. transformations employed. For only three did recall errors tend 
•toward the untransformed sentence as a kernal plus tag theory would pre- 
"dict. For recognition, stjhistic preference had-no effect, thus adding • 
stylistic preference in sentertces to the gr|bwing list of variables , which 
affect recall and recognition differently (others are listed by Adams, 
1968; Adams & Bray,. 1970; and Kintsch, 1970a, 1970b). Recognition perfor- 
' ,mance was much poorer for sentences from the transformational pairs, where 
the item and its distractor had, the same meaning but different syntax, ■ 
than for control sentences whic|i>iiad semantically ?change^ distractors. 

, Brewer and Shedletsky Xtlote 2\ ran a recall ^udy using sentences with 
parallel conceptual structijres ^Langacker, |SI73) because these have a large 
number of alternative surface forms. Vf^ey found that whi iV the subjects 
could recall the meaning of 41% of suck sentences, they only recalled' the 

correct surface form for ^S% of the sentences. Brewer (Note 3) /ran a " 

\ - ' " ■■ ■ ■ / ■ ■ 

> series of recall studies using sentence pairs constructed by using both 

items of a highly synonymous word pair. Sentences were rated 'f^r stylistic 

'preference and a trend toward the preferred form was found in recail 

(Exp, 11). This result obtained for both abstract and concrete isynonyms 

(Exp. III). Thus, the study lent fucther support to the demQnstfa=tio^By . 

Johnson, Bransford, Nyberg, arid Cleary (li072<) that memory for gist is as f 

prominent for abstract as for concrete verbal material, and to their > 

analysis that Begg and Paivio's (1969), contrary result'was dueetp^-Ji^ccfh- 

founding of comprehensibility with concreteness. < . ' ; ' 

Another manifestation of the difference between meniory for idea^ Vnd 

memory for words is the fact that it is njuih iror/diffitult to memorize a 
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passage verbatim than it is to learn the substance of the passage. Several 

,^ ' ' ' ■ ■ • ■* 

studies F?ave compared learning rates far these two types of learning. 

Jorf^s and English (1926) had subjects learn a 91 word. passage which they hdd 

'0' 

divided fhto 31 .idea groups. After on^ reading, the subjects recalled the . 

substance of an average of *76t of the. Idea groups. Verbatim memorization, ^ 

however, required an av6rage.of 5.3 more trials, with some subjects failing 

to achieve it.^' . ' 

Cofer (1941) u^ed"* passager of 25, 50, 100, and 150 wojTds In a study , 

which compared acquisition rates for substance and verbatim learning in 

f 

terms of time and trials to criterion. For both time and trials, verbatim 

i 

learning took longer than substance learning. Further, both time |ind 
trials to criterion increased more rapidly as a function of passage- length 
for verbatim learning than for substance learning. 

Constructive memory processes . In looking at errors which occur during 
recall of prose, it Is clear that some errors of commission at^ hot simply 
synonym substitutions, but rather result from the subject's inferring or 
imparting information which was not present in the original passage. These 
errors will be called constructive errors in this paper. 

IJhile Bartlett (1932) has certainly been the most renowned exponent of 
constructive memory processes (a distinction he must now, perhaps, share 
with Weiser, 1967), Binet and Henri (1894) once again provide one of the 
earliest* reports of the phenomenon. They noted what they called "errors 
through imagination" (pp. 55-50) which occurred in the- passages of more 
than 20 words. These errors were said to be characterized mt so m<nch by 
change in meaning as by addition to the meaning. These error; jclde.^ details 
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to th* original version. Thus, upon hearing "Thursday" a child would recall 
"last Thursday"-^ "Thursday evening." Likewise, "one of them" became "the 
youngest one", and "his parents' home" became' "his home." Clearly, such 
errors are examples of the type of constructive errors made famous by 
Bartlett. ' 

Bartlett's (1932) Remembering ia a book which is a landmark in the ' 
history of cognitive psychology. The current Bartlett revival has been 
spurred on by the prekhing of Neiser (1967) and the cleyer experimen- 
tation of Bransford,'' Barclay, Franks, McCarrell, and a>l the others, and it 
is long overdue* Although t^e book includes several different experimental 
methods and* materials, it is the repeated reproduction of prose material 
which is relevant here. Subjects read a story twice, and repeated free 
recalls were collected; the first 15 minutes a^er reading the story, 
with later recalls being collected, as the oppoi^iunlty arose, over retention 
intervals which spanned days or years. Bartlett »ted that for such recalls 
"accuracy of reproduction^ in a literal sense, is the rare exception, and 
not the rule" (p. 93).^ [lost recalls showed a "tendency to abbreviate the 
story as a whole and also the details that are reported" (p. 72). Bartlett 
also, howe"Vjpr,* reported another aspect of the recalls, the constructive as- 
pect. Some details or incidents were elaborated, with a marked dramatic 
effect. Although some subjects recalled only one or two striking details, 
^ers used the general setting and affective aspects of the story, along 
with prominent details, to construct an elaborate, if inaccurate story. A 
tendency by some subjects tc "rationalize" the story, by filling gaps or 
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distorting events so as to have causal relations between events or other- 

♦ 

wis^ make sense (to the reader)^s also found, . 

While Bartlett's reconstructive. notion is currently quite popular, * 



lis results and characterization of them f\ave not gone unchallenged, 
f Gomulicici (1956) investigatjj^ the imnediate recall of prose passages from 
15 to 200 words. He found that although subjects were able to recall the 
shorter passages verbatim,, they were only able to recall the more important 
aspects of the longer passages. . ifurtheir, Me found that omissions were by 
far the most comnon error type, and he concluded that memory for passages 
was more accurately described as' abstractive than as cons|ructive* In fact, 
when judges were given both recall protocols and deliberately ^rltten ab- 
.stracts of the same passages, they were little better than chance at 
distinguishing between them. 

Gauld and Stevenson (1967) suspected that the changes and distortions 
in recall which Bartlett found were due to deliberate inventions bjpthe 
subjects w||o want |o "fill up gaps in their memory," a process which is 
separable frta memory itself, rather than being a result of constructive . 
memory processesN-They ran several studies using the "War of the Ghosts" 
which showed that if subjects were told to recall only what was in the story 
and to leave gaps rather than invent if they had forgotten, this sharply 
reduced the number of meaning changing intrusions "br additions. If the 
subjects were simply toldl to be accurate, the effect was the same as the 
longer Injunction. Of course, these results are amenable to other interpre- 
tations. The former instructions mig^t set up demand characteristics 
(Orne, 1973) which favor the production of gaps. The latter (as well as the 
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former) might cause the subject to raise his subjective crite/rion for the 
acceptable confidence level for response emission (Adams & Bray, 1970), 
causing the subject to suppressinformation he remembers. 

While earlier studies generally failed to provide any strong support" 
for the constructive view of memory, several recent studies have provided 
striking examples of reconstructive processes in action. , The first in the 
" line of- studies by Bransford and his col leagues was Bransford, Barclay, ^. 
and Franks' (1972) famous "turtle" study which used unrelated sentences. 
In it, they found that subjects who read;sentences like, 

Three turtles rested on a floating log, and a fish swam beneath theiR. 
^ had higher algebraic recognition scores ("oTd^" (+.1) x confidence; "new," 
^ (-1) X Mnfidence) for foils like, . # ' _ 

Th^e turtles rested on a floating log, arid a fish swamJ>eneath it. 
^Jfcgt tbe>Ndid fpr sentences where the spatial relationships was changed 
r?6side" Aibstjttuted for "on" ift l^th sentence and foil). 

that time the Bransford group has conducted several studies 
using brief prose passages which make the same point. A study which 
Sye Solomon ran (cf. Dransford & HcCarr^ll, 1974) extended the Bransford, 
Barclay, and Franks result to brief prose passages. Subjects heard a 
brief passagje describing the relative locations of a pond, forest, etc." in ■' 
relation to/a farmyard. Before hearing the passage, they saw a picture of 
a farmhouse and a hill. Depending on the condition the subject was in, he 
saw a picture with the farmhouse on the hill or beside the hill. Subjects 
who saw the fomfer picture were very likely to falsely recognize a state- 
ment which gave the location of something relative to the hill which had been 

r ' 16 
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stated rel^ive to, the farmhouse (or yite versa), while subjects who saw 

the latter picture »^ere not'. ^ 

Johnson, Bransford, and "SblomDn (1973) presented subjects ??1th' bri^f 
passages such as: "John was trying to fix the bird house. He i^as pouniing 
the nail when his father came out tVwa^ him and to help fiim do the Wjtrl^; 
Or "It was late at night when the phone irartg and a' voice gave a fr^ntit^ 
cry. The spy threw the sefcret document Into the.fl4i^ Just in ^^me, sinpe 
30 secortds longer would ha\je been too late." They found that subjects who 
heard such" passages were^rSr likely^o falsely recognize statemervts which, 
included an unstated instrument (e.g. "John wiping the hainner to fix the 
birdhouse. t . .") or consequence ("The spy4«jrned the document just in ' 
time. . . ."). Subjects Wty) heard control passages where minimal word >e 
changes, altered the meaning (^.g. "looking Vor the na11"Vinstead of "poun- 
ding the nail"; "pulled the secret document froBv^e fWe" instead of 
"threw the secret document into the fire"^) were much less likely to make 
such errors. A ^^'^ 

Brewer (Note 4) has initiated a series of studies on memory for the^ 
pragmatic implications of sentences. Pragmatic inplications differ from 
logical implications in that while logical implications are necessarily 
implied under the logic oft^me system, pragmatic implications are not dicta- 
ted by any fonnal /system. PTrijgniatic implications, rather, express expecta- 
tions derived from the Subject's knowledge of the world. Pragmatic impli- 
cations were tested for bjL^plication of the "but-not" test deriwcfrfrom 
Lakoff's (1971) analysis of the use of "but" as a denial of expeStion. 
This test excluded 4<iptances' which had logical implications or lacked an 
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Implicational relSionship. krewer found ttat pragmatic recalls constituted 
an average of 20%^ 30% of the total recalls across 46 items ah^ were as 
high as 80VB8% for some items. . Examples of the most common implicational 
errors are: . . 

The hungty python caught the mouse. ' / 

tended to be recalled as ' 

The hungry python ate the mouse. \ # 

The safe cricker put the match to the fuse. \ i 
tended to be recalled as ^ 

The safe cracker Ti^ the fuse. 

Othfer examples of memory for pragmatic implication are provided by- 
Harris (1974) and by Brewer and Ljchtenstein (1974). Harris fo«nd that 
sentences like « 

Miss America said ^hat she played the tuba, 
are often recalled as 

Miss America rvl^ayed the tuba. , 
Brewer and Lichtensteih found that for continuous aittonym pairs, where 
negating one does not logically imply the other (unlike dichotpmous anto- 
nyms, where such implieation holds), subjects often recall th6 pragmatic 
implication. Thus, subjects who heard a sentence stating that something is 
"not hot" were apt to recall a plr^^matically implied sentence stating it 
was "cold." 



A recent wave of studi^sjH^irianipulated the operation of constructive 
processes In discourse memory. Fredey-lcVsen (1972,, 1975b) affected the 



occurence of constructive errors In discburse recall by Instructing subjects 
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to remember a passage or to try to solve sociological problems posed by it, 
Spiro (Note 5) has demonstrated dramatic effects of telling subjects that * 
they were in a memory experiniiw, or that the experiment was concerned with 
their reaction to interpersonal rela^tGns d&cribed in the passage. Dooling 
(Note 6; Sulin and Dooling, 1974) has shown that at long retention intervals 
constnictiye errors occur for the recall of a passage about a famous person 
if the passage is given the 'appropriate title (Adolf Hitler) but not if it 
is -given a fictitious main character (G^ii-ald ^^artin).. f - 

Differences between Sentence Lists and Connected discourse 
As we have seen, it has be^n demonstrated with both sentence lists and 
connected discourse th^ what is rdmembereds isn^suallyjiot the exact wording 
of^the material, but rather, i.ts meaning. Further, memory often goes beyond 
the mere reproduction of meaning and exhibits constructive properties which 
. result in material being recalled or recognized which was not'preseift in 
tlje original message. Against tKis backdrop of important similarities we 
will now consider the question of whether or not important differences 
exis"t between sentence lists and discours'e.' First We will consider the 
studies which have directly compared the two materials, and then we]win 
examine the question from' a, broader theoretical perspective. S\ 
Direct Experimental Comparisons / 

In this section we will examine studies wht#h have directlyjodmpared 
sentence lists with connected discourse'. Unfortunately, these studies have, 
for the most part, been atheoretical or primarily directed at other issues, 
so that the comparison usually does not go beyond a simple test for dif- 
ferences in amount reaflled. There a,,, however!, a few notable exceptions; 
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and, hopefully, useful suggestions for better motivated comparisons will be 
offered later in, the paper. / z"' \ ' ' 

A nuirber of studied (e.g., Dotf'ling & LachLn, ig71 j\ Lactoilan & Pooling, 
1968; Montague & Carter, 1973; Pompi & Ladcman, 1967; Yuille fr PaVvio, 1969) 
have manipulated discourse structure by presenting the words in eithe# 
iidrmal or scrambled order. The studies to be reviewed here,^ however, will 
onrly be those Jhlch compare sentences In randiB|Hzed lists to connected dls- 
course. • Such experiments are rather rare ^in the qperimental psychology 





literature. , 

Qmer (1973) ^T^%9ll^ig0l/[^\m\^y report' of two stu- 



"I 



n latencies » one using connecte/ discourse and the other 



fences In randMlzed lists. A more recent arid complete 
report of tjtee studies Is also available (Anderson, 1974). T|ie - 
prfniary purpose of the studies was not to (ibmpare the two types of organlzatlgru^ 
^^*stu4y was Intended to test two rival hypotheses about the storage of 
sentence form (active or passive) Information. It was found that for both 
types of materials, with immediate verification, verlflcatlolr^'Wai' faster 
when the voice of the target sentence and the probe matched. When verlfica- « ' 
tlon was delayed (2 min.), the effect only reached significance for the 
^scrambled sentence condition, although the same trend occurred for the 
"story** condition. . 

Brent (1969) conducted a serial ant1t|i^pat1on study where the materials 



, used w^Y^' organized at four different levels: Isolated words, anomalous 
sentences, natural sentences, and paragraphs. List length was also varle 
(22, 34, 35, or 38 words; 7 or 10 sentences). It was fQund (Exp. I) that 
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errors (verbatim scoring) were inversely related to the level or organiza- 
tion. Further, although list length was a significant factor^at all levels 
of organization, the two factors had a significafit interaction result;K)g 
^ fro^the decreasing potency ofr the list length effect at higher level? 6f 
org'Jmization. 

Just as Brent's studjy grew out of earlier serial anticipation studies 
- . (i.e.. Brent. 1965. 1966, 1967)1 so Levin (1970) was led to comparing un- " 
related sentences^ discourse iy studies of the effect of enteddii^g words 
in sentences in^serial anticipation or paired-associate word learning tas^. 
Levin noted -that the studies in which .sentences had a facllitative effect^' 
(e.g., Clark & Bower, 1969; Levin & "Posner. 1968) used^a single sentence or 
- related sentences, while those that failed to find a facilitative effect 
(e.g.,^Jensen & Rower, 1963. 1965; Levin & Rower, 1968) used unrelated sen- 
tences or phrases. Levin argued that the difference between the two^sets of 
results obtained becaus^the sentence or sentences served to unify the liit 
in thc^ormer case, but not In the latter. He systematically varied the 
nuirb^f unrelated sentences in which words were entedded in order to test 
this hypothesis. Using the same 12 word list, subjects studied the words 
in either a single, long, connected sentence (a short story) or in 2, 3, 4^ 
or 12 unrelated sentences (incorporating 6, 4, 3,^2. or 1 items per sentence^ 
respectively). For both antici-patiori and serial learning, with fourth and . 
fffth grade subjects, it was found total trials to criterion was greater 
when 3, 4. 5, or 12 sentences w^re used than when 1 or 2 sentences were 
used, as, Levin had predicted. ^ 
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In the educkional psycholoay literature, there are a number of studies 
which have followed the line of Investigation begun by Frase (19'69a), in 
comparing name, attribute, and random organization -of passages. Frase used 
a passa^ which descril?ed the attributes (moves, captures, values, etc.) of 
chessmen! for each of 6 chessmen, 8 attributes were described, always in a 
Separate sentence, yteWing 48 sentences. The sentences were organized, by 



name (all 8 sentences describing .a given chessman groupec^toget^h^r)^ by 
attribute (the« senteffces describing a given attribute for all. 6 chessmen 
grouped together), or iPlandomly. siibjects. received three study- fre^e recall 
trials follf/ed by a mui^tiple choice test. For the recall tests, both 
attribute and name organization resulted in better performance than did 
random organization. The same trend appeared in the multiple choice test, 
but did not reach significance. I ^ * 

Of the studies^which ha^foyowed in this line of research, not all 
have included a random o We r condition. Some have simply compared name and 
attribute organization (e.g., Frase, 1973a, 1973b; Friedman & Greitzer, 1972) 
Still, several have inoluded random organization. Schultlt & Devista (1972) 
used name, attribute, and random organization of a passage of 36 sentences 
in which six imaginar^^couny^ies were given six attributes each. Once agai», 
three study-free recall trials were employed, and once again name and attri- 
bute organization both produced better recall than did random ordering. 
>1yers, Pezdek, and Coulson (1973) used a 25 sentence passage of five ficti- 
tious countries with five attributes each. In the two experiments which 
included random organization, it v/as found to produce poorer recall than 
either name or attribute organization across three study-recall trials 
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(Exj3. I) and for both free and seriiJ recall (Exp.. II). Perlmutter and Royer 
(1973) replicated. the- effect in a swdy in which they used the same material 
as f^ers et al.. and the three study-test^trials, but required subjects to 
recall the items by name, or by attribute, or by free recall. 

In contrast to the studies reviewed above, two studies (Bruning, 1970; 
Carter, Note 7) have failed to find any differencfe between connjscted discourse 
^ and sentence lists. Bruriing used 25 paragraphs of /our sentences each on a" 
fictitious African tribe. The study included three types(^of organization: 
regular prose, scrambled, and unrelated sentencefs tn random order. The first 
two conditions used the same sentences, but different in their order. The 
third condition had unrelated- sentences quasi -randomly selected from an. ** 
- almanac, and randomly ordered. All three, however, had the same five target 
sentences in the same positions in the passage. Only the target sentences 
were tested using a "fill in" ti^in which the sentence was provided, with^-' 
a blank replacing the one\word or number which was the correct response. 
While both prose and rarfdom organizations exceeded the irrelevant sentence 
condition^ they did not differ significantly between themselves. < - 

Bruning 's failure to obtain a s^ignificant effect of sentence order may/ 
have been due to the methpd of testing employed in his study. The one word 
completion task he used is quite different from the free recall tests typi- 
cally employed by the investigators cited ^bove. Kissler and Lloyd (1972) 
investigated this possibility in a study which employed both a completion 
. and a short answer test. They used eight paragraphs on math models for bufl^ 
ness, each of which had a superordiflate sentence and four subordinate sen- 
tences. Half of the paragraphs were strongly related and the other half 
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were weakly related as determined by the /Coeffici.en^|||^ concordance for 
"independetjt groups of- subjects who ordered-the sentences within the pafaf^ 
graphs. Three study-test trials' were given with short answer essay .questions 
* "that required the subject to i^calV information from at least two sentences 
in the paragraph" followed by 16 due word completion 'items constituting the 
test phase. The eight paragraphs v^re presented in booklets -in either 
nonnal or sc^^ambled order, with sen|ence order being a. between-subject 
' factor. In the>cramb.led condition, the superjMrflTnate senten'ce was ffrst 

in each paragraph, and the other sentences were randomly ordered .wltUn the , 
, paragraph. For the completion test, only thi trials effectN^as^J^ficant. 
Thus, Bruning's failure to obtain a significant order effect Using f comple- 
tioh test was replicated^ For the short answer. test, however, the order 
effect was highly signV^ticant^^js was the trials effect. , A surprising resjjlt" 
was that the normayl order was especially better than scrambled order for the 
unrelated p^aragraphs, as evidenced by a significant order x relatedness 
^ interaction. Order also interacted with trials, as the order effect was 
stronger for trials 2 and 3 than for the first trial. 

Carter (1972) presented a 1,500 word passage on "HiWots," a ficti- 
tious South American tribd^in normal on scrambled sentence order. The 
subjects were instructed eitf^er simply to read the passage, to read it and 
subjectively organize it (using notes, underlining, etc.), or to memorize 
a supplied outline of the material and relate the passage to it. The subjects 
were tested with a free recall test followed by a "cued recall" test which 
consisted of fill-in, short-answer, andN^rue- false questions. The passage 
was divided into 21oSdea uhit^for the purpose of scpring the free recalls. 



ERIC 24 



Sentences 
23 



Substance scoring was employed. Fn»e recall and "cued recalU' results were 
analyzed together in a single ^nalysis of variance which included a cueing 
. factor. ' No effect of sentence- order or of instructipnarset was detected. 
While the cueing effect reached'signi finance, it did not.interact wijh either 
order or instri%.ions. 

These resul^' were not what Cartel had anticipated. He had predi(*ted a 
. ' ■ . ■ . ' . 

significant order effect a«d<an order x cueing interaction due to a. more 

pronounced order effect for free recall than for "cued recall." Carter and 
Carrier (Note 7) have pursued the matter further, and met with somewhat 
better results. They report two studies which employed a 1,067 word version 
of the Himoot passage which had four hierarchical levels (A»Himoots; B«- 
R^igion, Economic System, Appearance; C*3 jubtopics for each B- level topic; 
/ D«2 subtopics of four sentences each for each C-level. subtopic). In their... 
first experiment, sentence order (normal or i^ndom) was orthogonally combined 
with superordinate sentence (levels A & B) placement (beginning^or middfe . 
of the passage). The passage was presented in booklets with one sentence p^r 
page, and three successive r6petitidns of each sentence, or three repetitions 
^of the entire passage. A free recall test was administered after a four 
minute filled retention interval ^nd scored using substance scoring for 
210 idea units.. The result of primary interest for our purposes is ,th^t, 
while neither the sentence order nor syperordinate placement efrecf^ reached .\ 
significance, they did interact significantly. The intejeaction obtained 
because normal sentence order was better than scrambled order for the versions 
with superordinate sentences at the beginning. The main effect of oAkwas 
significant for a clustering measure, as it had been in Carter's (1972W 
original study. 
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Carter and Carrierjs^ second experiment used the same material, except ^ 
^ thdib superordlnate iPlacement was not varied and thQ booklet pages contained 
nine or ten sentences. The passage was read one or tJ^iree times. Both free 
recall and cued recall (C level cues) were tested, and the results were 
analyzed together. This time the sentence order effect was significant, but 



•interactions exposures and cueing revealed that the sentence order 
effect obtained with three exposures but^qt^dne^ and with cued recall, not 
with uncued.' A coitlpletion test was also given, but the only significant 
effect on this test was exposures. * 

A study by Lee (T965) employed a closely related order manipulation, 
He manipulated "supra-paragraph structure" by presenting tt^paragraphs of a 
1,600 or 2,100 word passage on Nc^Val discipline' in fandMMM order*, in 
normal order, or in normal order with added initial a'od ^^HRsumma ry para- 
graphs, transitional paragraphs, and main headings. In all /three conditions, 
the order of sentences within. paragraph^ was normal. In two experiments, 
subjects were tested with short essay^ questions worded and scored with 
emphasis on abstracting the main ideas presented in the passage, other 
questions designed to see how, much detail could be recalled, and a two- 
alternative recognition test with targets which were* sentences Ijfted verbatim 
from the passage, and distractors which w§re paraphrases of the targets. In 
bot^^s^udi^ the effect of organization was srignificant only for the test 
of main ideas, and thep the effect was entirely du^ to the high- level of ^ 
structure, which^twd^supplemerital material totaling '38D words ^ lit the first 

experiment and 510 words in the second. No difference between normal and s 

> 

randomized paragraph order was detected in either study. This is perhaps / 
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not too surprising, since the passages were selected on the basis of minimal 
Interparagraph dependencies, so that the paragraphs would make sense In 
any trder. Indeed, In the second study, when subjects were asked to rate 
how well the passage was organized or structured, the normal and scrambled 
versions did not differ significantly, although both were lower than the 
high structure version with the additional material. 

A somewhat related, field of the educational psychology literature is ^ 
the effect of frame order or sequopice in programned Instruction. Both detri 
mental effects of scrambling (e.g.. Brown, 19W; Buckland, 1968) and no 
.difference (e.g., HamiltoYi, 1964; Payne, KrathwohV, & Gordon, 1967) between 
scrambled orders have been found on criteirion test performance. Other 
studies have found no difference on criterion test performance, but nated 
that "logically" ordered programs produced fewer errors and required less 
time for completion (e.g., Niedermeyer, Brown, & Silzen, 1969). This paper, 
however, will not attempt to cover the programmed instruction literature. 
Theoretical Comparisons ■ ^ 

From t^e brief review. of experimental comparisons above, it is easy to 
see that connected discourse is usually easier to learn or remember than 
scrambled or unrelated lists of sentences. The lack of theoretical moti vatic 
in most of the studies, ho*<pver, makes premature any conclusions* that perfor- 
mance level on memory tasks constitutes the only difference. Even if that 
conclusion could be justified, it would still be desirable to explain th^ 
source of the difference: that is, it would still remain to det*hnine what 
aspect or property of discourse, or subjicts' processing of it, makes it 
more memorable (or easier to comprehend or learn) than sentences in lists. 
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5° seems, these questions have received little seriou^ attention. 

•The purpose of the next sections of this paper is to more closely consider 
these questions in the light of recent developments in experimental psycho- 
logy. The comparisons to be made divide naturally into*^two types: differ- 
ences between the materials themselves, or stimulus differences, and differ- 
ences between the way subjects process the materials. 

Stimulus d ifferences . One of the major difficulties which has ret^pled 
the development of the (Mscourse learning literature is the failure to make 
a clear distinction between the fom and the content of the discourse. As a 
stimulus, discourse may be described at several levels of abstraction, and 
it- is important not to confuse these levels. As a first approximation, three 
levels of description are sufficient: physical form, linguistic form, and 
gontent. Physical form is the level which describes the mode presenta- 
tion of the discourse, auditory or visual, and such physical characteristics 
of the stimulus as type face or location on a page for visual, presentation 
and gender of the speaker or background noise for auditory presentation. 
Linguistic form is the level which descHbes the syntax of the sentences and 
the lexical form of the words used in the discourse. Finally, the content 
level describes the meaning of the discourse, and includes the representa- 
tion of the inforpnational content of the discourse, as, for example, in 
propositions, and the structure or interrelations of that content. Early 
accounts of discourse structure tended to focus on t#ie form of the discourse. 
Thus, readability indices (Chall, 1958) were primarily based on word length 
(lexical fonn) and sentence length (syntactical form). More recent accounts 
of discourse structure have tended to focus on the content level (e.g.. 
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Crotfiers, 1972; Fredericksen, 1972, 1975a, 1975b; Kintsch, 1972, 1974). 
Trouble often results'when thesb levels are confused. Idea units (Hender- 
son, 1903) were developed In order to overcome some of the limitations of 
verbatim word counts (which are heavily dependent upon linguistic form) as 
a measure .oi: content recall. However, when verbatim scoring is applied to 
the idea /units, their original purpose is defeated and anomalous results may 
be obtained (e.g.. King S ^ssel, 1966). • 

Form . For the purpose of comparing sentence lists to discourse, all 
variables of form, except sentence order, may be factored out by using the 
same sentences in the two types of material. Clearly, sentence order' is 
not merely a form variable, but often affects content as well. Thus, while 
different sentence orders may sometimes convey essentially the same meaning 
or coi^tent (e.g., Crothers, 1972; Frase, 1969b), a random order will usually 
destroy much of the content of the discourse by transforming it into a 
sentence list. Arther, in discourse, sentence order is often used to con- 
vey information such as a causal or temporal relationship. This can be 
illustrated by an example where clause order varies within a sentence: "John 
fainted and fell off his horse" means something quite different from "John- 
fell off his horse and fainted" (adapted from Coleman, p. 277; Con'ference 
Discussion of Crothers, 1972). Likewise, "Tom's wife divorced him. He 
started drinkihg" implies a different temporal or causal relationship than 
"Tom started drinking. His wife divorced him." In sentence lists, since 
the sentences are unrelated, sentence order carries no information and is 
arbitrary and usually randomized. 
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Content , It is the relatedness of sentences in discourse, the fact 
* that sentences in discourse are related to the theme or topic ^f .the d^- 
course, as well as to each other, that, to the first approximation, distirh* 
guishes discourse content from the individual meaning of all the sentenges - 
in a sentence list. Another way of expressing the difference between serv- 
tence lists and discourse with regard to theme relatedness Is that a>nnected 
discourse has a top^^ or theme, while unrelated sentence lists do not. -.The 
notion of the topic or theife of discburse is not new in psychology Binet^ 
and Henri (1894) invoked^ a similar concept when comparing the Teaniing of 
words in lists to wo^ds in discourse: 

The differences between memory for words and memoi^v for connected 
^ discourse is suggested to be due to two causes. FiVsWwhen one 
recites either, one calls to mind a series of images; however, 
with isolated words, the images are d^organized, or they don't' 
interrelate naturally, and the child hears them wit|K)ut^ 
searching for a common arssociatioF^. With passages, however, 
the images are whole, one connected to the next, and perfectly 
organized. Second, when encountering isolated words, images 
are rapidly formed and each new one pushes the old one from 
consciousness, affecting its eventual strength, (p. 31, quoted 
in Thieman and Brewer, 1975) 
The same anaTysis could be applied to unrelated sentences in lists; unless 
the subject can invent a scenario linking them, they must be learned as a 
disgpjinected list of unrelated items. 
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Bartlett (1932) stressed tlie importance of the theme in connected dis- 
course, noting that, 

The form, plan, type, or scheme of a story seems in fact, for 
the ordinaryi eWated adult, to be the most dominant and per- 
sistent factor in the material (p. 83). , ■ 

More recently, Pompi and Lachman' (19B7) echoed Bartlett almost exactly when 
tjiey characterized the comprehension of connected discourse as the con- 
struction of "surrogate structures," which they said were "some combination 
of theme, imagei scheme, abstract^ or summary" (p« 1^3). 

Levin (1970) stressed the unification provided by a coamon theme 1^. 
order to explain the finding that for serial recal^, anticipation, and paired 
associate studies in which words were embedded in sentehces, only those 
which emffloyed a single sentence or related sentences demonstrated facili- 
tation (e.g., Clarfe, & Bower. 1969; Levin i.Rower, 1968). ' 

When the passage is vague enough, as in studies by Oooling and Lachman 
(1971), Bransford and Johnson (1972), and Dooling and Mullet (1973), then • 
the subjects will be unable to benefit from the theme 'relatedness of the • 
passage, unless they are cued onto the theme with a title or picture which 
disambiguates the material. The theme may also be obscured by the use of 
indefinite ai^icles, as in a recent study by deVi liters (1974). Brown 
(1973) pointed out that indefinite articles are used to introduce new 
referents into discourse, while a definite article is used to refer to a 
referent which has already been introduced. deVilliers manipulated the 
use of articles in order to make it more or less likely for subjects to 
treat a loosely knit passage about the adventures of a boy as a story rather 
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than as a set of unrelated sentences. When definite articles were used, the' 

subjects assumed that referents were shared across sentences, and they were 

more likely to treat it as a story. With indefinite articles, subjects 

assumed that referents varied "across sentences, and thus were more likely to 

treat the passage as a list of unrelated sentences. The manipulation was » 

not entirely successf^jl, but deVilliers used post-experimental questioning 

to determine how subjects had treated the passage and analyzed his data with 

subjects sorted on their answer. The subjects who treated the. passage as 

a story recalled more than did those who treated it as a sentence list. 

Clearly, then, a unifying theme is a key difference between connected 

discourse ^nd sentence lists. The effect of the theme is to unify the 

passage and tjj provide a semantic context which guides and facilitates the 

comprehens^ion, learning, and memory of sentences in the discourse. Pezdek 

and Royer (Note 8) conducted a pair of studies which. investigated the effects 

of providing context on the recognition of meaning and wording changes for » 

sentences. A list of concrete and abstract target sentences were presented 

either in isolation or; in a paragraph context, with two context sentences 

preceding the target sentence. Pezdek and Royer were primari'ly interested 

-in the effect of providing context on the detection of meaning changes in 

abstract sentences, and so their reported analyses focused on this predicted 

effect and the second order interaction (sentence type x'context condition x 

test type) in which ,it was embedded. The predicted e/^ect and interaction 

were significant in both experiments, although only marginally so in the 

first experiment. Of more general interest, however, is the first-order 

interaction of context and type of test. This interaction failed signifi- 



_ Sentences 

cance in the first experiment, and was not reported in the second experiment; 
but an inspection of the graphed results (Figure 2) clearly shows a strong 
interaction; providing context aids the detection of meaning changes (sub- 
ject-object reversal), but 'is actually detrimental to the detection of 
wording changes. Pezdek and Royer explain their results as showing that 
context increases the conpi»ehension of a sentence and hence the probability 
that it will be stored primarily as a semantic interpretation of its meaning, 
rather. than in its surface form as a list of words. Unfortunately, their 
study suffers from a design weakness which makes an .alternative interpretation 
of the results possible. The increased detection of meaning changes (and 
also the decreased sensitivity to w^jrding changes) in the context condition 
may have been due to information provided by the context sentences, rather 
than to any effect on the comprehension of the target sentence. A control 
group which studi^ the context sentences only, and- then was given the recog- 
nition test, would be needed to evaluate this alternatfve source of the 
fiffect. Another solution to the problem would be to compare the context 
condition to the same context and target sentences presented as a randomized 
list. - 

>^he limitation of relatedness as an explanatory concept for dfscour&e 
is that it says nothing about how the sentences or ideas and events expressed • 
. in them are related. The recent discourse structure modeH have been devel- 
oped in order to begin to specify and analyze how discourse content is 
organized and structured. They represent attempts to take the necessary 
next step beyond the simple, formless concept of relatedness. 
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Dawes (1964, 1966) analyzed discourse stnicture In terms of set thep- 
retlcal relations of identity, exclusion, inclusion, and disjunction. Tfife 
analysis was vse^^tto study the direction of errors in recall and recogni- , ^ 
tion, toward" diejunctkffh (*pseudodiscrimination''),.)pr away from it ("over- 
generalization"). Dawes' analysis, however, was specific to the artificialljy 
simple material constructed for the studies, and does not appear to have ■% 
the powef to be applicable to more complex i'orras .of discourse. Vrase's 
(1969b) structural analysis is subject to the same cWticisms. 

A much more elaborate method of analysis' was applied by Crothers (1972) 
who represented semantic content and structure equi^vjlSfently as a directed 
graph or logical predicates. Although the analysi9^was fairly rigorous and 
quite complex. It proved to be seriotisly off tfie mark. s|nce by Crothers' 
own interpretation (pp. 274,275), it failed to ^iemojfl^trat? even a trend for 
the overall theme to be recalled better than its d^^iir (superordinates 
versus subordinates in the representation) of^d^or ^a^^m^^or theme to be blotter 
recalled than a minor one (primary versus secondary jw}tj||rees ) . The failure 
to find better recall for the more importafrtt parts dijscourse Is an 
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especially telling deficiency, because thi^' is one of the most long standing 

and reliable facts of prose recall (e.g. »%a^|^t, 1932; Binet & Henri^, 

1894; Gomulicki, 1956; Johnson,. 1970). 

Meyer & McConkie (1973) used quite a simple' method of discourse^/ 

stpucture analysis. They had graduate students outline a passage, ^TTi.then 

converted the outlines to tree structures. From these tree structures, 

*• 

three measures of the importance of an idea unit /nlthe structure of the 
passage were developed: a hierarchy depth score, which measured how high in 

y ■ ' ; 
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the hierarchy the unit occurred; a urrfts beneath score, which measured, the 
number of units which were beneath the given unit in the hierarchy; and a 
combined hierarchy score, which combined the two above measures, equally 
weighted, into a single, unified measure. Significant effects upon recall 
were found for all three measures. Further, when significant effects of 
serial position and rated importance 'were found, these turned out to be 
largely due to the .correlation of thdse factors with hierarchical importance. 
They also found that, if a urrit was recalled, then there was nearly a 70« 
chance that the unit which occurred inmediately above it in the tree was 
also recalled, although, overall, recall was only about 23%. Further, com- 
bined hierarchy score was positively correlated with stability of recall 
across two recall trials. 

While Meyer and McConkie's (1973) analysis has been successful, it is 
intuitive and informal. It also seems to be more^useful for comparing the 
importance of units within a given structure than fffr comparing different 
structures. The analysis currently being developed by Kintsch sfipuld avoid 
such critidsms. Kintsch {ffJZ) developed his prepositional description 
of semantic content and organization as a proposal for semantic or lex4^al 
memory structure. Since that tige, however, he has used it to describe the 
semantic content and structure of discourse material itself. He has not 
used his system to analyze discourse, but instead, starts with the preposi- 
tional description or text base and derives prose from it. In this fashion, 
he has beerr able to keep prepositional content fixed while varying syntactic 
complexity in order to study the effect of the latter on "reading" and 
"inference" times (Kintsch & Monk, 1972). He has also been able to sy§- 
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teraatically vary the number of propositions In discourse and study the efffect 
of this factor on reading tiV and recall (Kintsch & Kieeoan, 1973). Most 
recently he varied the number of pi^osltlonal arguments (Kintsch, 1975)t' 

Although still In the early stages of development, Kintsch's system 
has already yielded promising results, ^g^dfe observing the expected 
effects of syntactic complexity, nuirfcer of propositions, and number of 
arguments, it also replicated the results of Meyer and McConkie (1972), using 
Klnts^'sf more formal and objective propositional description. Kintsch's 
(Kintjsch & Keenan, 1973) propositional rank is essentially equivalent to 
Meyer and McConkies' hierarchy depth score, and Kintsch's counting of 
descendant propositions is analogous to Meyer and McConkies' units beneath 
score. Kintsch's system seems^o hold the most hope for future development 
of a formal system of description for discourse structure and content. 

Propositional representation of discourse content also offers a j^rtwiis- 
ing basis for solving th^ vexing problem of scoring discourse recall. 
The methods normally used, verbatim or substance scoring oHP Words, idea 
units, or sentences, are all too surface bound, that is too closely tied 
to the linguistic form of the discourse, to be completely satisfactory for 
measuring content recall.* Propositional scoring would be simllar'ln Intent 
to idea units, but would lack the notorious arbitrariness of idea units, 
(e.g., Levitt, .1956). Propositional representation is theoretically moti- 
vated and is capable both of objectivity and also of validation. Kintsch's 
work is as much an attempt to experimentally validate the representation of 
prose content provided by his text bases as it Is an attempt to investigate 
the comprehension and memory of prose. By demonstrating that measures 
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derived using his system of representation «xert systematic control over 
discourse learning and memory, Kintsch vfflidates his system of representation 
Iwth as a representation of text\content, and as a basis for scoring discourse 
recall. ^' 

Before a particular prepositional representation can be sufficiently 
validated to be generally accepted, however, problems general to any such 
system must be solved. First of all, there is the problem of how to specify 
what is acceptable as a realization of a proposition In recall. Second, 
there is the problem, with propositions with multiple arguments, of how to 
score partial recall. Finally, there is the problem that not all proposi- 
tions-are equally important. For example, the sentence 

The red ball broke the window 
is represented in Kintsch ts sysjairas 

1. (Break, Ball, Window) 

2. (Red, Ball) 

It is clear that, for most purposes, the first proposition would be more 
fnportant than the second. Thus, for most purposes of investigating prose 
learning or recall, recall will be measured best not by a simple proposition 
count, but rather by a weighted total, where each proposition's weight Is • 
determined by its importance. Importance can be determined by the represen- 
tational system itself, 4s' in the work of Meyer and McConkle (1972) or 
Kintsch (1974; Kintsch & Keenan, 1973), or i{,may be assigned by the.inves- 
tigator, if he knows what he wants the subjects. to remember, as, for example 
in educational applications. - i 
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One major advantage of a clear distinction between form and content 
variables Is that It lea<ls to the cj^ncerh that effects that are attributed 
to content variables should not be due to the confounding of form variables. 
Two obvious possible confoundlngs which i^tght account for the el¥ect of 
hierarchical Importance on recall are 'siPiMaT position and the particular 
material' Involved. ^ -t^ V 

^ Serial position Is Ibf ten confounded with hierarchlckl Importance because 
structurally more Im^rtant propositions tend to occur at the beginning of 
discourse. Ifhus, aq^ observed stjructural Importance effect might be due to 
a lonfounded serial posltioji primacy effect. Both Meyer and McQonkiQ^ ^ 
(1973) 'and Kintsch (1974; Kfntsch et al^, 1975) have tested this possibility ' 
and have fourid that the structural Importance effect remains when serial 



(X)s1t1on Is g^^filTed or factored out. 

While. the confounding of serial position ;and hierarchical (Importance 

* • - ■ v' 

has concerned; in vest igatojjs of text recall, tffe confounding with materials 

used has not. Since the sentences of high structural importarice are dif- 

ferent from those of low importance, the possibility exists that the ob- 

■ *^ 

served importance effects are in fact d(k to the greater leamability. 



memorability, or familiarity of the propos^tional content or sentence form 
of the high importance material. CleSnrly this possibility cannot be dis- 
missed out of hand, and studies whici^fest it are urgently needed. Perhaps ' 
the simplest test would iS^ to present th|||||||||^ sentences in normal discourse 
and in 'randomized list s.^ If the stroctural importance effect were indeed 
due to the role of the material in the content structure of the discourse, 
thfn it should be observed only in 'the discourse condition and not in the 
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list condition. If the effect were due to^e confounding of the material 
Involved, then It would appear In both conditions.^ 

The development of models of proSe content has paralleled the.tfevelopment 
of network modcTfs of .semantic memory. Indeed^,\ Kintsch's system originated 
as a memory m^del and can be represented as a network, although more recently 

it Is employea as a ni^od of represrenting discourse content or text bases. 

* ^ 

Although much'of the semiBtlc'^memory work has gone into theoihy^arid research 
about the types ,of memory structures arjd process needed in ofder to account 
for latencies in verifying s^tatements like "canaries are yellow" (e.g. 
Collins & Qui 11 Ian, 1969, Ripps, Shoben, & Smith, 1973),*there is evidence 
that semantic memory models may be»^inov1ng towards applicability to prose v 
learning and memory^ Anderson and Bower's (1973) model HAH, which permits 
the embed;l1ng of proposition (context and fact) tree structures Into higher , 
order propositions, seems capable of being applied to discourse. Indeed, . 
HAfl might be able to^explaln deV1111ers' (1974)f result rather nicely by - 
postulating that new nodes were ccjLstructed when Indefinite articles were 
used, while links. to existing nodes were constructed for definite articles, 
producing Isolated propositions In the former case and Interconnected pro.-^ 
positions In the latter case. The structural model sketched by Rumelhart, 
Lindsay, and Norman (1-97^) and Rumelhart and Norman (Note 9 ), which Is 
based on events and episodes also seems capable of dealing with discourse, 
although the lack of specified processes makes any application speculative. 
Rumelhart (Note 10) has confronted discourse directly by developing a text 
granmar for children's stories. Linguists as well, have become concerned 
with the development of text grammars te.g., Petbfi & Riesser, 1973), 
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and Charniak (Note 11 , Note 12) has explo'red the knowledge an artificial 
Irvtelllgence .program would ne^ to "comprehend" children's stories. DIs- 
course structure will doubtless continue to receive an increasincp share of . 
attention 'in psychology, 11ngu1st1cs,*nd artificial intelligence. 

Subject prtfcessinq-. As importaftt is the stimulus differences between 
sentence lists and connected discourse are, confining an analysis of dif- : 
ferences between. the two types of material to stimulus differences would 
Clearly leave' the analysis incomplete and Inadequate. Differences between 
how subjects process sentence. lists and discourse are an equally importai^ • 
consideration. Wo truly explicit, comprehensive, and rfeasonable theory of 
how subjects process^'discourse is currently available, and none will be « 
offered here. In viej* of the rather undeveloped state of the field, thisf V 
is not surprising. Ratfier than reviewing existing process models of dis- 
course comprehensi8n, learning, and memory, such as they are, this paper 
will attempt to take turrent theoretical approaches which have broad applica- 
Wlity and have had considerable Impact and apply them to the comparison of 
sentences in lists and in discourse. The approaches to be considered fall 
into three categories; the semantic-episodic distinction, levels of pro- 
cessing, and the debate oVer whether recognition involves retrieval processes. 

Episodic a nd semantic memory . Tulving's (1972) paper which bisects 
memory and memory research has already enjoyed wide audience and Influence. * 
Tulving offered the dichotony in order to relate the traditional verbal 
learning experiments, such as word ttst^ and paired-associates to the newer 
sema«t1c memory studies (e.g. ^(^llins & Quillian. 1969; Schaffer & Wallace, 
T^O). In essenc^ episodic memory is the system which stores perceptible ' 
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aspects of events In an autobiographical 1y based, spatio-temporal re^ference 
system. Semantic memoryt on the other hand* stores formal t abstract knowledge 
of the language and the world without autobiographical reference. Tulving's 
distinction, then, is different from Atkinson's (e.g., Atkinson ft^ Juola, 
1974) partitioning of the long-term store into a lexical store and^vent 
knowledge store. In that Atkinson recapitulates the knowledge of the language 
versus knowledge of the world or analytic versus synthetic dichotooy while 
Tulving does not. 

It would seem that Tufifng's bisection might neatly separate sentence 
lists and discourse. Sentencf lists are, after all. lists of verbal material. 
Just like the word lists which are paradigmatic of episodic memory research. 
Discourse, on the other hand, seems a likely ca/tdldate for storage In seman- 
tic memory. In fact. Tulving cited the high Incidence of "Intrusions" in 
the recall of a prose passage (Howe. 1970) as evidence of the lack of reten- 
tion of the ^urface form or perceptible stimulus properties of a passage 
4 

when its ifieaning is stored In semantic memory. 

On closer examination, however, this type of neat separation breaks 
down. In the first place, the same loss of surface form which Tulving finds 
typical of semantic memory occurs wi^ sentences in lists (e.g.* Bock & 
Brewer. 1974; Brewer. Note 3) as well as with discourse. Further, mavny of 
the passages used by psychologists investigating discourse learning and 
•memory make storage in semantic memory extremely unlikely. It is just not 
reasonable to expect subjects to add "information" about a fictitious South 
American tribe or a serenade with tflectric guitar and balloons to their know- 
ledge of the world, devoid of any autobiographical refereiTice. Thlis, retention 
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of meaning after loss of surfcce form, and even errors pf inference (Binet & 

Henri, )^4) or pragmatic Implication (Brewer, Note 4) must be possible for 

mm 

episodic memory also. Further, the extent to which the Information In dls- 

* * 

course, or, for that natter, sentence lists, gets stored In semantic nenory 
will depend on the extent to which It Is perceived by the subjects as bona 
fide knowledge of the world worthy of such storage. That Tulvlng's dlchotony 
does not neatly divide sentence lists and connected discourse should co^ as 
no surprise, since he caut||i'ed that "the exercise of Identifying various 
memory situations with episodic or semantic memory Is neither simple nor 

particularly Infonsatlve, since many tasks contain bot* episodic and semantic 

« 

features" (p. 368). 

4 

levels of process 1 no . Another major contribution to thinking about 
siAJect processing Is the levels of processing analysis of Anderson (1970, 
1972) and Cralk and Lockhart (1972; Cralk. 1973). It Is possible that the 
level of processing at which subjects typically process sentences In lists 
differs from the level of processing typical of discourse. In ord6r to 
consider this possible difference, we must first examine the levels of pro- 
cessing nltlon. ' ' , 

The levels of processing concept seems to have grown naturally from 
^earlier thought in both experimental and educational psychology. In 
reviewing the incidental learning literature. Mechanic (1962) and Postman 
(1964) concluded that intention (to learn) per se was not important, but d' 
the type of processing required by the task was crucial. Posner (1969) 
applied a levels of processing analysis to the abstraction of letter stimuli. 
Bobrow and Bower (1969), while they did not use the term levels of processing. 
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did hypothesize that "comprehension of a sentence aids Its retention" and 
Investigated incidental tisks designed to eri^ure "more reliable comprehension 
than does simply reading a sentence" (p. 458). Educational Psychology has 
produced a line of research deslgned to explore subject processing activities 
as detennlmers of learning from prose. Rothkopf coined the term "mi^themagenlc 
activities" (1970) which means activities which "give birth to learning" 
and Is Intended to emphasize that meaningful processing (comprehension) Is 
necessary for any slgnlflcai^t learning from prose to occur. Siibject pro- 
cessing of text has been affected by means of aids such «s instructional 
objectives (e.g., Rothkopf and Kaplan. 1972) and Inserted questions (see 
Anderson and Biddle, 1975, for review). This paper i#111 focus on the 
fcmulatlons of Anderson and Cralk and Lockhart, but similar thinking has 
also been reflected In thfe work of ^others such as Fnase (1970), Hyde and 
J€nk1n^(1969, 1973), Laberge and Samuels (1974), MacKay (1973), Mlstler- 
Lachraan (1972, 1974), and Triesman anJ Truxsworth (1974). 

Cralk and Lockhart's (1972) levels of processing has come out of the 
verbal learning tradition. They propose levels of processing primarily as 
an alternative to multlstore models of memory (e.g., Atkinson & Schlffrin, 
1^, 1971; Broajbent, 1958; Waugh & Norman, 1965). Cralk and Lockhart's 
levels may be grouped Into stages: sensory analysis, pattern recognition, 
and stimulus elaboration. Their levels of processing has'been Influenced 
by recent accounts of perception (Sel fridge & Nelsser, 1960; Sutherland, 
1968; Trelsman, 1964). They use the term *depth" of processing where deeper 
processing means more semantic or cognitive processing. They do not, how- 
ever. Insist that later stages of processing proceed In a fixed, hierarchical 
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order. In fact, they suggest that later stages of processing might better 
be characterized as "spread" of encoding rather than depth. Despite this 
mild disclaimer, however, they retain the term depth (Implying hlerai^y) and 
postulate that "trace persistence Is a function of the depth of ^^nalysls, 
with deeper levels of analysis associated with more elaborate, logger lasting, 
and stronger traces" (p. 675). 

Anderson (1970, 1972) draws upon both the verbal learning and educational 
psychology literatures In developing his levels of processing. Me tentatively 
outlines the levels (for printed material) as orthographic ^coding, phono- 
logical encoding, and semantic encoding. Semantic encoding Is the end product 
of comprehension, which has been characterized by Anderson and Ortony'(197i5) 
as "constructing a particularized and el aerate* mental representation." 
Anderson's main concern Is with this^ deepest level of encoding becad'se It 
Is the level which characterizes a nature readTr reading text, and because 
It Is the level of encoding required for tbe attainment of reasonable and 
significant educational goals. He points out, however, that both orthographic 
and semantic \odes may be retained In raeiwry. While semantic encoding 
usually predominates with sentences and discourse, so that the meaning and 
not the exact words themselves are retained. It Is possftle to learn the 
exact words of a passage of text (the Boy Scout Oath or Pledge of Allegiance) 
and even to learn a passage In an uncomprehended foreign language (perhaps 
a song lyric). 

The important commonality in the thinking of Anderson, Craik and 
Lockhart, and the others, is that they characterize processing as hierarchical, 
with deeper levels of processing, operating on the products of shallower 
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levels. Deeper levels are more cognitive or meaningful, and their products 
are more ea^ly or better retained in memory. 

fne filstiorv. for the purpose of this paper, is whether subjects are 
more likely t# comprehend or deeply analyze sentences when they appear in 
discourse 'Sian when they occur in Mnrelated lists. For example, subjects . 
might be more inclined to try to cginprehend discourse, and to rotely memorize 
sentences in lists. If the levels Of processing analysis is correct, then 
several predictions are possible for both recall and recognition. For recall, 
substance scoring should yield higher scores for discourse than for sentence 
lists, as was typically found in the studies reviewed above. A clear pre- 
diction about Jthe absolute level of verbatim recall is not possible, since 
verbatim r^ll may result not'||^iy from retention of the exact words, but 
also from ^le<ention of the meaning, if it is. then re-^expressed ir^ the same 
words. Clearly, however, the proportion of substance recall which is also 
verbatim should be lower for discourse than for sentences. Discourse should 
produce more synonym substitutions, word order changes, and other changes 
that do not substantially change meaning than should sentence lists. In 
addition, errors of inference (Binet & Henri, 1894) should be more conroon fo»» 
discourse than for sentences in lists. For recognition, it would be pre- 
dicted that under neutral learning instructions recognition for semantic 
content should be greater for discourse than for sentence lists, but that the 
situation should be reversed for the recognition of the verbatim form of the 
sentence. Thus, levels of processing can predict Pezdek and Royer's (1973) 
result. Finally, experimental manipulations designed to ensure meaningful 
encoding of the sentences, such as imagtry instructions or cover tasks which 
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require comprehension, should have less effect on discourse than on sen- ' 
tences ^rr lists, since thfc former Is more likely to result In meaningful" ' 
encoding In the'sbsence of such manipulations. Conversely, manipulations 
which discourage comprehension, such as having subjects overtly rehearse the 
material or monitor wrds for spelling errors, should have less effect on 
sentence lists than on discourse. These predictions certainly merit ex^rl- 
mental Investigation. 

Cralk and Lockhart's level of processing approach has another area of . 
applicability to the comparison |f sentence lists and discourse. They suggest 
that primary memory or "maintaining or recirculating Information at one level 
of processing" by paying "continued attention to certain aspects of the ' 
stimulus" (1972, p. 676) be substituted for the short-term store. When | 
Items are recirculated In primary memory, they are not subjected to deeper 
l*rels of processing. Such recirculating can produce high Inmedlate recall 
of the final items in a word list (the recency effect), but the lacfc of deep 
processing means that retention after the recirculating stops suffers, as 
evidenced by the negative recency effect Cralk (1970) found on a final free 
fecall after several lists. Cralk and Watkins (1973, Exp. II) have since 
shown that inserting a 20-second unfilled retention Interval, and thus 
greatly increasing the time in primary memory and the number of overt re- 
hearsals of the last few items over immediate, recall does nothing to aTleviate 
the negative recency effect. 

r 

Since sentence lists are unrelated lists, like word lists, subjects 
may be expected to recircuStTthe last few items if they know that a short- 
term memory rec^^^l test is impending. Subjects, however, should be more 
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likely to process connected discourse at a deep level throughout, without 
Employing, a recirculating strategy near the end. deVilliers (1974) observed 
that subjects who treated his passage as a sentence list tended to output 
j the last sentence or two first on recall, similar to the "dumping" strategy 
observed in short-term memory experiments with word lists (e.g., Uaugh & 
Norman, 1965). Deese and Kaufman (1957) had previously noted the fact that 
subjects recalling discourse don't employ a dumping strategy. They reported 
a study which compared discourse recall with recall of a word list. For 
di scours^, recall order was essentially perfectly correlated with input 
order. ^ kTJth word lists, however, subjects tended to recall last presented 
items first. Similar dumping has been found in the studies of short-term 
memory for proverbs by Glanzer and Razel (1974). 

Since subjects employ recirculating and dumping with sentence lists, 
they should exhibit the marked short-term forgetting which occurs in the 
recency portion of the serial position curve. Subjects recalling discourse, 
however, should be much less affected by a filled retention interval because 
tf)e deeper level of processing applied to the final items should make them 
more resistant to forgetting. Likewise, final free recall after a series 
of sentence lists shoujd^roduce the negative recency effect, while deeper 
processing should prevent negative recency from occurring with discourse. 

Retrieval processes in recognition . The final theoretical issue 
that will be examined in an effort to illuminate the comparison of sentence 
lists with discoijfse is the ongoing debate as to lihether context or organi- 
zational effects in recognition memory constitute evidence forVetrieval 
processes in recogni^tion. 
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A. widely held view in the verbal learning 1 literature has beerv that 
recall involves both a generatioi*, retrieval, or Search process and a recog- 
nition test (based jon familiarity, tagging, or matching with a standard), 
of which only the latter is nec#ssary^for recognition (e.g., Adams and Bray,^ 
1970; Anderson & Bower, 1972; Kintsch, 1970a; McCormack, 1972; Murdock, 1968). 
By this view, 'the trace stored in memory is iiiinediately or directly accessible *f 
(content addressable iln cSnputer parlance) Th recognition, so no search pro- f 
cess is necessary. TH|e ppposing view, championed chiefly by Tulving and his ^ 
associates (e.g., Thompson, 1972; Tulving & Thompson, 1971, 1973; Watkins & i 
Tulving,.;i975) «nd Handler (1972), is that search or retrieval processes play | 
an important role in recognition as well as in recall. In other words, the 
memory trace is assumed not to be directly accessible during recognition, 
necessitating a' search. Anderson and Bower's (1973) match process by which " 
HAM recognizes sentences is an example of such a search. 

A common research strategy in this area has been to identify an experi^ 
mental variable which is believed to influence retrieval processes exclusively 
and then see if recognition is affected by such manipulation. If recognitijjn 

* - 

is not affected, then the notion of searchless recognition receives a measujre 
of support. Organization was originally thought to be such a variable. Cofer 
(1967), Bower (1968), Kintsch (1968), and Bruce and Fagan (1970) reported 
studies using word or nonsense syllable lists which showed the organizational 
manipulations which affected recall left recognition unaffected. Since that 
time, however, the evidence that organization and context affect recognition 



as well as recall fias been steadily amassing (e.g.. Bower, Clark, Lesgold, & 
Hinzenz, 1969; D'Agostino, 1965; Franks & Bransford, 1974; Jacoby, 1972; 
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Lachman & Tuttle, 1965; Light & Carter-Soebel , 1970; Light & Schurr, 1973; 
Handler, 1972; Thompson, 1972; Tulving & Thompson^ 1971, 1973; Watkins & 
Tulving, 1975; Winograd & Carr, 1971). . .s 

Tulving and flandler argue from the evidence that organization or con- 
text affects recognition that recognition includes a search process. Another 
explanation, however Is also possible. The recognition Effects can also be 
explained if it is assumed that organization or context af f ects ^encodi ng or 
storage and not retrieval. This argument is currently getting much play 
(e.g.. Light, Kimble, & Pellegrino, 1975;' Martin, 1975; McCormack, 1972; 
Pellegrino & Salzburg, 1975; Reder, Anderson, & Bjork, 1974) tased on en- 
coding variability (Martin, 1968) or stimulus sampling^eory (Bower, 1972; 
Martin, 1972). Levels of processing (Andersfon, 1970, 1972; Craik & Lockhart, 
1972) can also be used to explain the effects of co*ntext or organization on 
recognition in terms of encoding difference^. 

With discourse, the effect of organizational variables on recognition 
has received very little at^tention compared with recall. Most of the work 
that has been done has come) from Lachman and his associates (Dooling & 
Lachman, 1971; Lachman & Dooling, 1968; Pompi & Lachman, 1967). This series 
of studies has compared free recall to recognition using a rapidly paced 
sorting task (recognize, don't recognize) with single word stimuli. The 
organizational variable has been "syntactical" (normal discourse) order 
versus random word order. In the Pompi and Lachman study, distractors were 
of two types: high thematic relatedness (judged by an independent group of 
subjects as likely to appear in an alleged short story of which the experi- 
mental passage was a summary) and low thematic relatedness (items selected 
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in the same fashion for an unrelated story). No significant difference ob- 
tained for hit rate or false alarm rate, but a highly significant thematic 
relatedness x word order interaction occurred for the false alarm rate, 
owing to the factthat subjects who read the syntactical order made false 
positive errors much more frequently for the high thematic than the low 
thematic distractors, while relatedness had no effect on the false alarm 
rate for subjects who read the random order. Both Lachman and Dooling and 
Dooling and Lachman used unrelated randomly selected distractors. Lachman 
and Dooling varied training trials (2, 4, 6,. or 8), but no test of whether • 
. the order effect was significant is reported. T^e recognition data is included 
along with recall data in a single analysis that produced many interactions 
which go unanalyzed. Inspection of the graph provided (of hit rates) how- 
ever, reveals that if there was an ordlwf; effect for recognition it was due 
to the groups receiving two or four training trials with the second of the 
two passages used. For the other conditions an. order effect seems most 
unlikely. In the final study in the series, Dooling and Lachman add an 
intermediate level of organization, random phrase order. They use "vague 
and metaphorical" passages which are difficult or impossible to comprehend 
without a title and, orthogonally to the level of organization," reveal the 
title to half the subjects and deny it to the other half (as did Bransford 
and Johnson, 1972). Neither the order or the title main%ect reached 
significance when hit rates or false alarm rates were analyzed. Both main 
effects, however, reached significance when difference (hits minus false 
alarms) were used. The order x title interaction never reached significance. 
Recall (free recall, verbatim word count) in all three studies was stcon^^^* 
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affecWd by order, with syntactical order producing better recall than ran^fomj^v 
word order.. In Lachman and Dooling's study the -order effect for recall Jl^l.^- 
increased over trials. Pooling and Lachman ''s random phrase order prQ^tK$4^'. 
a recall level intermediate between random word order and syntact^^!ii>der. 

Clearly these studies provide some^evidenc^ for the ■effect^f discourse 
organization on recognition. The use of single word stimuli and a rapidly 
paced sorting task, however, may have served to minimize this effect. Since - 
subjects tend to retain the meaning of the passage better than the exact 
words used, sentences would.be more suitable to testing recognition. » Pezdek 
and Royer (1973) used sentence stimuli to test for the detection of meaning 
or wording changes for sentences presented with or without context. As 
was noted above, however, the additional material presented in the context 
• condition provided a confounding that can be a^|||^i|^an order manipulation. 
An organizatfonal factor that has not been investigated for recognition 
is hierarchical importance -in discourse (Kintsch & Keenan, 1973; Meyer & 
McConkie, 1973). So far, research has shown that structurally more Important 
phrases or propositions are recalled better in a free recall task, but the 
effect of structural importance on recognition nee^s ta be tested in order 
to determine the generality of the e/fec'l Oncp^gainf presenting the 
material in scrambled and discourse order would permit the unconfounding ^ 
of the material itself with structural importance. 

Although organizational or context effects do not necessarily support 
the notion of an important role for retrieval processes in recognition, 
there is another line of evidence which is somewhat more convincing. Reac- 
tion time studies beginning with Sternberg (1969a, 1969b) have shown that 
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the time to decide whether or not an Item Is a member of a target set depends 
on the size (number of members) of the target set. Although'the target 
sets were quite small In the original Sternberg studies (1-6 one place 
dlglts^^lp Wfect of set size has proved to have surprising generality. 
Atklnsoj/and Juola (1973, 1974; aTso Atkinson, Herman, and Wescourt, 1974) 
have obtained the; size effect with target sets of 30 words or more. Landauer 
and Friedman (1968) and others (Landauer & Meyer, 1971; Meyer, 1970; Wllklns, 
1971) have demonstrated the size effect for preexisting semantic categories 
by measuring the time It takes to decide whether a word (e.g., collie) be- 
longs to a semantic category (e.g., dogs, animals). 

Recently, Kintsch (1974) has extended the size effect to discourse 
memory. U|^ng factual passages of 20, 40, 80, or 120 words, Xintsch found 
that the timfe required to majce an affirmative response on a true-false or 
"verbatim" recognition test was an Increasing linear function of passage 
length. Negative response latencies were not significantly affected by 
passage length. 

It is odd that Kintsch should 'measure passage length by words, in view 
of his commitment to a propositional representation of discourse content. 
If the size effect proves to be reliable for discourse, it will provide 
another method for the comparison of alternative methods of representing 
and measuri^ig^scourse content. By constructing materials such that the ^ 
number of units on two different measures of content are orthogonal to 
each other, the two measures could be compared. For example, four passages 
might be constructed with size specified by number of words and number of 
propositions:' 



52 



Sentences 

• .51 

Passage Words 
1 

2 w 

3 4/3w 
4^ 4/3w 

The ineasure that demonstrated the larger effect would be competitively ^up- 
ported relative tothe other measure. Initially, propositions might be ' 
compared to words or sentences. A more interesting comparison, however, ' 
would be to compare the n-ary propositions of Kintsch to the binary "proposi 
tions" of HAM (Anderson & Bower, 1973). 

Conclusion 

The purpose of this paper was to compare sentence lists and connected 
discourse in order to determine the'.relationship between these two types of 
materials. A brief review of the two literatures revealed that for both 
sentence lists and discourse, memory for meaning typically exceeds and out- 
lasts memory for-words or syntactical form, and that memory for both types 
of materials has been shown to involve, constructive or Inferential processes. 
Thus, important similarities between sentence lists and discourse do unques- 
tionably exist. Unfortunately, the question of the existence of important 
differences between the two types of materials cannot be so confidently 
answered. A review of the existing literature which compares sentence lists 
and discourse primarily revealed that discourse is usually easier learned or 
better remembered. The lack of theoretical motivation in most of these stu- 
dies limited the search for differences to comparing amount recalled. A few 
important exceptions, however, did emerge. Thus, Anderson (1974) found that 
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whether the voice of a test sentence matched Its target affected verlflcatlOfT' 
latencies with delayed testing for sentences in lists but not in discourse. 

Pezdek and Royer (1973) found that supplying paragraph context for a target ' ^ , 

. ■# ■ ♦ .' ' 

sentence aided detection of meaning changes while, perhaps, hindering detecr- . ' 

' ■ ■ i ■ • - . 

tion of wording changes. deVilliers (1974) observed that subjects who per- 4 ' 

■■ 

ceived his passage as a sentence list tended to recall the last sentence Or 

two fipst, while subjects who perceived it as a connected story Recalled it 1 

■ 

in the order it was presented. 

These studies suggest th^t important differences between sentence list;s*f 
and discourse may indeed exist, in addition to this amount recalled. How^yeSp, ' 
due to the embarrassing lack of empirical evidence on the issue, the nature ,^ 
and extent of such other differences remains unj^nown. It Is hoped that this- .4 
^ paper has helped to pojnt the w^y to filling this void in our knowledge of ^ 
human learning and memory. The question of what sorts of differences might [ 
be found has been examined in the broad context of stimulus differences be- 
tween the two types of materials and in terms of possible differences in 
the way subjects process them, ^ggestions of possible differences and of 
methods for experimentally investigating them have been offered. No detailed-*^ 
theoretical account of the differences between sentence lists and discourse 
ivas provided, indeed, in view of the present state of our ignorance on this . 
matter, such an undertaking would be premature. The psychologists' careful, f 
scientific, experimental study of human verbal learning and memory has put %s 
.much time and effort into the study of nonsense syllables, word lists, and ^ ^ . . 
paired-associates. Only recently have sentences and discourse begun to 
claim their fair share of the psychologists' interest and study. While we 
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2y> 

are amassing quite an-Tmpresslve literature and body of data with both types 
of materials, our understanding of their relationship lags behind. It is 
hoped that the analysis provtded in this paper, and especially the investiga- 
tion of the experimental questions posed in it, will help move us closer to 
understanding how people comprehend, learn and remerter sentences, both in 
lists and in discourse. 
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